A Shrinking-Based Dimension Reduction Approach for Multi-Dimensional Data Analysis

نویسندگان

  • Yong Shi
  • Aidong Zhang
چکیده

In this paper, we present continuous research on data analysis based on our previous work on the shrinking approach. Shrinking[2] is a novel data preprocessing technique which optimizes the inner structure of data inspired by the Newton’s Universal Law of Gravitation[1] in the real world. It can be applied in many data mining fields. Following our previous work on the shrinking method for multidimensional data analysis in full data space, we propose a shrinking-based dimension reduction approach which tends to solve the dimension reduction problem from a new perspective. In this approach data are moved along the direction of the density gradient, thus making the inner structure of data more prominent. It is conducted on a sequence of grids with different cell sizes. Dimension reduction process is performed based on the difference of the data distribution projected on each dimension before and after the datashrinking process. Those dimensions with dramatic variation of data distribution through the data-shrinking process are selected as good dimension candidates for further data analysis. This approach can assist to improve the performance of existing data analysis approaches. We demonstrate how this shrinking-based dimension reduction approach affects the clustering results of well known algorithms.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Dimension Reduction Approach Using Shrinking for Multi-Dimensional Data Analysis

In this paper, we present ongoing research on data analysis based on our previous work on the shrinking approach. Shrinking [22] is a novel data preprocessing technique which optimizes the inner structure of data. It can be applied in many data mining fields. Following our previous work on the shrinking method for multi-dimensional data analysis in full data space, we propose a shrinking-based ...

متن کامل

A Shrinking-Based Approach for Multi-Dimensional Data Analysis

Existing data analysis techniques have difficulty in handling multi-dimensional data. In this paper, we first present a novel data preprocessing technique called shrinking which optimizes the inner structure of data inspired by the Newton’s Universal Law of Gravitation[22] in the real world. This data reorganization concept can be applied in many fields such as pattern recognition, data cluster...

متن کامل

Principal Component Multi Linear Analysis for Content Based Image Retrieval

In the process of content based Image retrieval (CBIR), image information is presented in descriptive features to obtain retrieval of image information. In the representation of descriptive features a large feature count is observed, which results in the overhead in processing. To reduce these descriptive features different dimensional reduction logic were used in which PCA is the most commonly...

متن کامل

Differenced-Based Double Shrinking in Partial Linear Models

Partial linear model is very flexible when the relation between the covariates and responses, either parametric and nonparametric. However, estimation of the regression coefficients is challenging since one must also estimate the nonparametric component simultaneously. As a remedy, the differencing approach, to eliminate the nonparametric component and estimate the regression coefficients, can ...

متن کامل

A Chance Constraint Approach to Multi Response Optimization Based on a Network Data Envelopment Analysis

In this paper, a novel approach for multi response optimization is presented. In the proposed approach, response variables in treatments combination occur with a certain probability. Moreover, we assume that each treatment has a network style. Because of the probabilistic nature of treatment combination, the proposed approach can compute the efficiency of each treatment under the desirable reli...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004